https://nova.newcastle.edu.au/vital/access/ /manager/Index ${session.getAttribute("locale")} 5 An Empirical Investigation of Incident Triage for Online Service Systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:43502 Wed 21 Sep 2022 10:01:20 AEST ]]> How to mitigate the incident? An effective troubleshooting guide recommendation technique for online service systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:39871 Wed 06 Jul 2022 08:49:22 AEST ]]> How incidental are the incidents? Characterizing and prioritizing incidents for large-scale online service systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:39861 incidental incidents. Our qualitative and quantitative analyses show that incidental incidents are significant in terms of both number and cost. Therefore, it is important to prioritize incidents by identifying incidental incidents in advance to optimize incident management efforts. In particular, we propose an approach, called DeepIP (Deep learning based Incident Prioritization), to prioritizing incidents based on a large amount of historical incident data. More specifically, we design an attention-based Convolutional Neural Network (CNN) to learn a prediction model to identify incidental incidents. We then prioritize all incidents by ranking the predicted probabilities of incidents being incidental. We evaluate the performance of DeepIP using real-world incident data. The experimental results show that DeepIP effectively prioritizes incidents by identifying incidental incidents and significantly outperforms all the compared approaches. For example, the AUC of DeepIP achieves 0.808, while that of the best compared approach is only 0.624 on average.]]> Wed 06 Jul 2022 08:36:19 AEST ]]> Onion: Identifying incident-indicating logs for cloud systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:44029 Wed 05 Oct 2022 15:19:12 AEDT ]]> Multi-task Hierarchical Classification for Disk Failure Prediction in Online Service Systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:53301 Tue 21 Nov 2023 12:04:11 AEDT ]]> NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365 https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:53307 Tue 21 Nov 2023 12:02:23 AEDT ]]> Towards Intelligent Incident Management: Why We Need It and How We Make It https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:41877 Tue 16 Aug 2022 09:42:16 AEST ]]> Cross-dataset time series anomaly detection for cloud systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:48346 Tue 14 Mar 2023 18:46:36 AEDT ]]> NTAM: Neighborhood-temporal attention model for disk failure prediction in cloud platforms https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:43830 Tue 04 Oct 2022 11:25:27 AEDT ]]> Robust log-based anomaly detection on unstable log data https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:42098 Thu 18 Aug 2022 11:43:37 AEST ]]> Neural feature search: a neural architecture for automated feature engineering https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:42085 Thu 18 Aug 2022 11:24:23 AEST ]]> Continuous incident triage for large-scale online service systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:43375 Thu 15 Sep 2022 15:53:31 AEST ]]> ReBucket: a method for clustering duplicate crash reports based on call stack similarity https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:31728 Sat 24 Mar 2018 08:43:31 AEDT ]]> Mining succinct and high-coverage API usage patterns from source code https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:31799 Sat 24 Mar 2018 08:42:48 AEDT ]]> UniParser: A Unified Log Parser for Heterogeneous Log Data https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:46971 Mon 12 Dec 2022 16:19:24 AEDT ]]> Predicting node failure in cloud service systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:36058 Mon 03 Feb 2020 12:34:07 AEDT ]]> Outage Prediction and Diagnosis for Cloud Service Systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:50076 Fri 30 Jun 2023 12:37:14 AEST ]]> Efficient incident identification from multi-dimensional issue reports via meta-heuristic search https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:42143 Fri 19 Aug 2022 13:39:06 AEST ]]> CONAN: Diagnosing Batch Failures for Cloud Systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:53212 Fri 17 Nov 2023 12:05:41 AEDT ]]> How long will it take to mitigate this incident for online service systems? https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:39739 Fri 17 Jun 2022 18:27:13 AEST ]]> HALO: Hierarchy-aware Fault Localization for Cloud Systems https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:39729 Fri 17 Jun 2022 17:52:05 AEST ]]> SPINE: a scalable log parser with feedback guidance https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:54836 Fri 15 Mar 2024 11:52:33 AEDT ]]> Improving service availability of cloud systems by predicting disk error https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:35051 Fri 14 Jun 2019 12:45:40 AEST ]]> An empirical investigation of missing data handling in cloud node failure prediction https://nova.newcastle.edu.au/vital/access/ /manager/Repository/uon:53586 Fri 08 Dec 2023 15:46:14 AEDT ]]>